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Abstract 

The Category Game is a multi-agent model that accounts for the emergence of 
shared categorization patterns in a population of interacting individuals. In the 
framework of the model, linguistic categories appear as long lived consensus states 
that are constantly reshaped and re-negotiated by the communicating individuals. 
It is therefore crucial to investigate the long time behavior to gain a clear under- 
standing of the dynamics. However, it turns out that the evolution of the emerging 
category system is so slow, already for small populations, that such an analysis 
has remained so far impossible. Here, we introduce a fast no-rejection algorithm for 
the Category Game that disentangles the physical simulation time from the CPU 
time, thus opening the way for thorough analysis of the model. We verify that the 
new algorithm is equivalent to the old one in terms of the emerging phenomenology 
and we quantify the CPU performances of the two algorithms, pointing out the 
neat advantages offered by the no-rejection one. This technical advance has already 
opened the way to new investigations of the model, thus helping to shed light on 
the fundamental issue of categorization. 
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1 Introduction 



The Category Game (CG) is a computational model in which a population of 
individuals co-evolve their own system of symbols and meanings by playing 
elementary language games [1]. It has been introduced to investigate how 
categorization can emerge from scratch in a group of individuals who interact 
in a pairwise way without any central coordination. The reference problem 
is color categorization, which is a central issue both in linguistics [2] and in 
cognitive science [3,4,5]. Color naming represents in fact a fundamental access 
point to human cognition, and at the same time provides important clues 
on language evolution. The evolution of English color categories constitutes 
an excellent example. English color terms exhibited a gradual semantic shift 
from largely brightness color concepts (Old English) to almost exclusively hue 
concepts (Middle Enghsh) [6]. The World Color Survey, moreover, showed 
that color systems across language are not random [7,8], but rather exhibit 
certain statistical regularities, thus opening the way to a revolution in cognitive 
science [5,9]. 

The main point of interest of the CG is that it is able to reproduce quali- 
tatively and, most remarkably, quantitatively the empirical data gathered in 
the WCS [10]. It also differs from the other models defined to address similar 
issues [1,10,11,12,13,14,15,16,17,18] in that it accounts for the categorization 
of a genuinely continuous perceptual channel and it describes a categorization 
pattern as a continuously evolving metastable state on which the population 
shares a temporary consensus [1,19]. The latter characteristic is intriguing and 
underlies the existence of a new framework to address the puzzling problem 
of language change, which turns out to be at the same time propelled by the 
interaction among the speakers and impeded by the need of these speakers 
to understand each other [19]. The presence of this sort of frustration renders 
the dynamics of the model so slow that it has been so far impossible to in- 
vestigate and quantify properly the details of the long time behavior, even for 
small population sizes. 

Here we present a fast algorithm suitable for studying the CG dynamics over 
large timescales and for moderately large population sizes. The algorithm is, 
in spirit, similar to those suggested for accelerating Monte Carlo simulations 
(see [20,21] and also [22] for other examples), where the key ingredient is to 
avoid rejection steps (hence the "no-rejection" tag). Of course, however, the 
dynamics we are referring to is substantially different, so new methods had 
to be developed in order to tackle the specific aspects of the model under 
consideration. The rest of the paper is organized as follows. In section 2, we 
outline a detailed description of the CG model. The next section presents the 
fast algorithm suitable for investigating the long-time CG dynamics. Section 4 
compares the outcomes of the fast algorithm with the original one, showing 
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excellent qualitative as well as quantitative agreement. In this context, a de- 
tailed investigation of a set of relevant observables is also performed, shedding 
new light on the "microscopic" origin of the "macroscopic" behavior of the 
system. In section 5 we show the computational complexity of the proposed 
fast algorithm, compared with the one of the original model. We conclude in 
section 6 by summarizing our contributions and pointing to possible future 
applications of this method. 



2 The Category Game model 

The Category Game model [1] is crafted to examine how a population of inter- 
acting individuals can develop through a series of language games [23] a shared 
form-meaning repertoire from scratch without any pre-existing categorization. 
We consider a population of N artificial agents each of them having, without 
any loss of generality, a one-dimensional continuous perceptual space span- 
ning the [0, 1) interval. A categorization can be identified as a partition of this 
space into discrete sub-intervals which we shall denote from now onwards as 
perceptual categories. Each agent has a dynamical inventory of form-meaning 
associations linking the perceptual categories (meanings) to words (forms). 
The perceptual categories as well as the words associated to them co-evolve 
over time through a series of simple communication interactions (or "games"). 

In each game, two individuals are randomly selected from the population and 
one of them is assigned the role of a speaker while the other the role of a 
hearer. Both the speaker and hearer are presented with a scene of M > 20 
stimuli (objects) where each stimulus corresponds to a real number in the [0, 1) 
interval. By definition, no two stimuli appearing in the same scene can be at 
a distance closer than d^i^ which is the only parameter of the model encoding 
the finite resolution power of any perception (for instance, the human Just 
Noticeable Difference). 

One of the objects is the topic of the communication. The task of the speaker 
is to communicate this to the hearer using the following prescription. The 
speaker utters a word associated with the topic while the hearer tries to guess 
its meaning from the word she "listened" . The speaker always checks whether 
the topic is the unique among the presented stimuli to lie in the correspond- 
ing perceptual category. If it is not, i.e., if the two stimuli collide on the same 
perceptual category, then a new boundary is created in the perceptual space 
at a location corresponding to the middle of the segment connecting the two 
stimuli. A new word is invented for each of the resultant two new categories. In 
addition, both of them inherit all the words corresponding to the old category. 

^ Without any loss of generality in all our simulations we shall use M = 2. 
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This process is termed as discrimination. Subsequently, the speaker utters the 
"most relevant" name for the category corresponding to the topic where the 
most relevant name is either the one used in a previous successful communi- 
cation or the newly invented name in case the category has just been created 
due to a discrimination. For the hearer, there can be the following possibilities: 
(a) the hearer does not have any category associated with the name, in which 
case the game is a failure, (b) there are one or more categories associated with 
this name in the hearer's inventory. In this case, the hearer randomly chooses 
one of them. If the category chosen corresponds to that of the topic, the game 
is a success, otherwise it is a failure. 

Depending on the outcome of the game one or both the agents update their 
repertoires. In case of a failure, the hearer adds the word in her repertoire 
hnked to the category corresponding to the topic. In case of a success, this 
word becomes the most relevant name for the category corresponding to the 
topic for both agents and they remove all other competing words from their 
respective repertoires linked with this category. Note that if both the speaker 
and the hearer already have only the successful word in the corresponding 
category, the inventory of both of them remains unaltered after the game. 
This situation, as already mentioned in the introduction, corresponds to a 
rejection step of a Monte Carlo algorithm. The time t of the dynamics is 
simply measured as the number of games played by the agents. 

The CG dynamics results in the emergence of a hierarchical category structure 
comprising two distinct levels: a basic layer, responsible for the fine discrimi- 
nation of the perceptual space (i.e., the perceptual categories), and a second 
shared linguistic layer that groups together perceptions having the same name 
to guarantee communicative success (linguistic categories) . Note that while the 
number of perceptual categories is tuned by a parameter of the model and can 
be very large (of the order of l/rfmin)! the number of linguistic categories turns 
out to be finite and small, as observed in natural languages (see fig. 2). 



3 A fast algorithm for the Category Game 

The primary goal of developing a fast algorithm is to study the long time dy- 
namics of the model. In the original algorithm accessing such long timescales 
would be extremely costly because of the freezing of the dynamics into metastable 
states. In particular, there would be many games in which the two agents (i.e., 
the speaker and the hearer) would end up making no changes in the config- 
uration of their respective inventories. The time would then increase by one 
time step. The basic idea behind the current version of the algorithm is to 
overcome this freezing by imposing that each game produces an outcome, i.e., 
a change in one or both the inventories of the two agents that are playing. In 
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this case, the time t has to be properly rescaled to recover the frozen dynam- 
ics. Furthermore, the correct statistical frequency of the different games (the 
playing order of the pairs and the probability to play in a given region) has to 
be reproduced in the no-rejection version of the model. In order to do so, we 
need to calculate both the probability that a game produces an outcome and 
the individual probabilities of each possible way in which the outcome can be 
obtained. 

In particular, the main steps of the fast algorithm are the following: 

Choosing the pair of players: this is the most important part of the algo- 
rithm since one has to choose a pair such that their game will produce an 
outcome. Let us define Pontihj) = Poutij,i) as the probability that a game 
between the two players will produce an outcome. We shall shortly describe 
the detailed method for computing this probability. For the time being, let 
us say that each of the A^(A^ — 1)/2 possible pairs will be extracted according 
to this probabihtjIZI- 

Choosing the region for placing the topic: once the pair of players has 
been chosen, we impose that the game will produce an outcome. As we 
shall see below, an outcome can follow if the topic falls either in a mismatch 
region or in a soft match one (both of these terms will be defined shortly). 
Subsequently, we need to choose the region to place the topic proportional 
to the corresponding probabilities p^}p(i, j) (for the mismatch region) and 
Pou^'^i'^'^j) (fo^ the soft match region)[£j. 

Game: the game is performed in the selected region with the same rules as 
in the original CG algorithm summarized in section 2. 

Rescaling time: at the end of each game, time is increased by a factor 

l/Pout(«, j)- 

Note that, of course, each probability is also a function of time (we have not 
explicitly displayed this dependence so as to keep the notations as simple as 
possible; we shall maintain this implicit form from now onwards throughout 
the rest of the paper) . 



^ The actual probability with which each couple is extracted reads: 

Pontih j) / T,i'<j' Pout{i' , f)- 

^ Note that p^^{i,j) + Pout^'^ihj) — Poutihj)- If we want to consider the con- 
ditional probabilities of having a game in a soft match region or in a mismatch 
region given that the game has an outcome, we have to divide the above defined 
probabilities by Poutihj)- 
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3.1 Extracting the no-rejection regions 



In order to calculate the probabilities of interest, we introduce the following 
definitions: (i) a match region for the two playing agents is a region where 
both the agents have the corresponding linguistic category settled (i.e., with 
only one label) and a unique label for it; (ii) a mismatch region is any interval 
in [0, 1) that is not a match region. 

Let us now consider a game where a speaker-hearer pair and two stimuli are 
selectecj3- Of course, if the topic falls in a mismatch region an outcome is 
guaranteed. If the topic falls instead in a match region, the situation is more 
tricky. In this latter event, two cases are possible: either there is the necessity 
to discriminate (see section 2) for one or both the agents, in which case the 
game produces an outcome, or the repertoire of both the speaker and the 
hearer remains unchanged (the outcome is null). To distinguish between these 
two events, let us refine the definition of a match region in the following way: 
we will denote a match region to be a strict match region if the lengths of 
the corresponding perceptual categories of both the speaker and the hearer 
are shorter than drain- In this case no discrimination is possible, due to the 
finite resolution constraint, and the outcome of the game will be surely null. 
A match that is not strict shall be called a soft match. 



3.1.1 Probability of playing in a soft match region 

Since the soft match definition is based on the length of the underlying per- 
ceptual categories, it is natural to express the probability of having a game 
in that region as the sum of the probabilities of having a game in each of the 
underlying perceptual categories: 



where the sum is over all the perceptual categories spanning the soft match 
region and we assume the dependence on the agents i,j to be implicit. 

Thus, we have to calculate the probability Pa^^^^ that both the topic and 
the object fall in the perceptual category a which belongs to a soft percep- 
tual match region and the game produces an outcome. Two events have to be 
considered simultaneously: (a) the topic falls in the match region, and (b) the 



without any loss of generality we shall consider from now onwards the first stim- 
ulus as being the topic of the game. 
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object falls in the unio of the two perceptual categories under considera- 
tion (i.e., of the speaker and the hearer) so as to produce a discrimination and, 
thereby, an outcome. As we recall from the above, the two events considered 
here are not independent but correlated through (imin- Consequently, we need 
to integrate all the different possibilities of placing the topic and the object 
maintaining this correlation. In order to write the expression for Pa^'~'^'^, we de- 
fine the left and the right boundary, respectively, of the considered per- 
ceptual category of the speaker, and correspondingly for the hearer. Further, 



X 



l,r 



min(a;;^^, x/^^) 
= min fx™ 



and X"^' 

is shown in fig. 1. We finally obtain: 



max . 
r 

max 
r 



dmin). An example illustrating the above terms 
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Speaker 



a a a a a 



Length ofd. H 

a a a a 
Hearer i i i 




J^mn 



Union 



Fig. 1. An example to illustrate the different terms for computing Pc^^^^ . Accord- 



ing to this figure, (a) x™™ = min(xf,3;P) = xf, (b) x™™ = min(x^,x^) = 
(c) xY^^ = max(xf,x/^) = xf, (d) x™^^ = max(x^,x^) = x;^, (e) 
^max _ niax(xf,x/^ + drain) = xf (taking into account the length of (imin shown 
in the figure), and (f) = min(x^,xf — dmin) = x^ (taking into account the 

length of dmin shown in the figure). 



r J 



SOFT _ 



mm 



J dy J dz + J dy J dz 



Pa 



X:""^-X'"^^)(xP + rfmin) 



^ The union is the region in [0, 1) that belongs either to the perceptual category of 
the speaker or of the hearer. 
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3.1.2 Probability of playing in a mismatch region 

In order to obtain the probability of playing in a mismatch region, we have 
to consider once again the correlation due to the minimal distance dmin in 
the extraction of the topic and the object. The probability of producing an 
outcome by means of a game in a mismatch region equals the probability 
that the topic falls in there. In order to calculate this probability, we must 
distinguish the case in which the first extracted object is the topic from the 
case it is not. In the first case, where the topic is the first extracted object, 
the probability Pafo-st that it falls in a mismatch region is simply the length 
of that region. In the second case, where the topic is the second extracted 
object, we have to consider the fact that it should be chosen at least dmin far 
apart from the first object. This can be done in a similar way as described in 
the previous subsection for the calculation of the probability of playing in a 
soft match region and we shall call this probability Pa^cond- Since the first and 
second case occur with equal probability in the original model, we can write: 



MIS_1/ MIS , MIS N (o\ 
Pa — 2 V/^afirst ^ Pasecond) K'^) 

PoT = EPa"^^ (4) 
a 

where the sum is over all the perceptual categories spanning the mismatch 
region. 



4 Comparison of the observables from the two algorithms 



In this section, we show that this no-rejection version of the CG algorithm 

features the same dynamical properties of the original one. We consider all 
the relevant observables reported in [1] as well as new microscopic observables 
analyzed here for the first time. 



4-1 "Macroscopic" observables 



Here we look at (a) the average number of perceptual {uperc) and linguistic 
categories {nung) per individual, (b) the success rate and (c) the perceptual 
and linguistic overlap across the population as functions of time for the two 
models (original and fast) we are comparing. We show simulations for different 
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t/N 

Fig. 2. Perceptual and linguistic categories obtained from the old and the new 
algorithm for = 25, 50 and 100. Solid lines show results obtained from the old 
algorithm while broken lines indicate results obtained from the new algorithm. All 
the results are averaged over 30 samples. 

population sizes and a fixed dmin = 0.0143 which is equal to the average human 
JND [10]. 

Fig. 2 shows the average number of perceptual and linguistic categories per 
individual obtained from the old and the new algorithms respectively versus 
the number of games per player (i.e., t/N). It is apparent from this figure that 
the new algorithm is not only able to reproduce the same phenomenology but 
also the outcomes are very close to what is obtained from the old algorithm. 

Fig. 3 compares the success rate for the two algorithms versus t/N. The suc- 
cess rate is measured as the fraction of successful games over sliding time 
windows. The figure clearly indicates that the results from the two algorithms 
match (almost) accurately. 




The overlap function [1] measures the degree of alignment of the category 
boundaries of two agents (z, j) and is defined as follows: 

2 ^ 2E,.(^c^)' 

o = — — — - y with = 2 (5) 

where 1^ is the width of the category c, c, is one of the categories of the i*'* 
agent and is the generic category of the intersection set containing all of 

^ The average number of perceptual categories remains same across different pop- 
ulation sizes as long as the value of d^\^ is fixed. Therefore we only show one 
representative plot for the perceptual categories in fig. 2. 
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t/N 

Fig. 3. Success rate obtained from the old and the new algorithm for N = 25, 50 
and 100. Solid lines show results obtained from the old algorithm while broken lines 
indicate results obtained from the new algorithm. All the results are averaged over 
30 samples. 




the boundaries of the agents i and j. The function Oij returns a value propor- 
tional to the degree of alignment of the two category inventories reaching its 
maximum unitary value when they are perfectly aligned. The linguistic over- 
lap is defined as in eq. 5 however, only considering the boundaries between 
categories with different most relevant names. 

Fig. 4 shows the perceptual and linguistic overlap obtained from the old and 
the new algorithm versus t/N. Once again there is an excellent agreement 
between the results obtained from the two algorithms. 
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Fig. 4. Perceptual and linguistic overlap obtained from the old and the new al- 
gorithm for = 25, 50 and 100. Solid lines show results obtained from the old 
algorithm while broken lines indicate results obtained from the new algorithm. All 
the results are averaged over 30 samples. 
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4-2 "Microscopic" observables 

As pointed out in the previous sections, one of the main ingredients of the 
fast algorithm is the computation of the probabihty that at each time step 
a game between a randomly chosen pair of players together with a randomly 
selected topic in the interval [0, 1) would produce a non-null outcome. Here we 
compare the behavior of the original and the fast algorithm with respect to 
this property. In particular, in fig. 5(a) we show for the original algorithm the 
fraction of games, collected in time sliding windows, that produce a non-null 
outcome, while for the no-rejection algorithm the probability Poutihj) of the 
selected pair {i,j) averaged over the same time windows. In the inset, we also 
show the (representative) histogram of Pout for each possible pair of agents 
at three different points in time for a specific population size. It is evident 
that the histogram is strongly peaked, which means that Poutihj) is roughly 
similar for all pairs {i,j) thereby, allowing for an effective random choice over 
the pairs without altering the dynamics. 

Subsequently, in fig. 5(b), (c) and (d) we respectively compare the two algo- 
rithms in terms of the fraction of games, collected in time sliding windows, 
which ended up being played in a mismatch region, or in a match region, 
or where a discrimination process occurred. Once again, the results exhibit a 
remarkable qualitative as well as quantitative agreement. 



5 Comparison of the computational complexity of the two algo- 
rithms 

In this section, we compare the computational complexity of the two algo- 
rithms, i.e., we compare the computer tim^ (tc) in seconds required to com- 
plete a specific number of games per player for the old and the new algorithms. 
In addition, we give more extensive results for the computational complexity 
of the fast algorithm. 

Fig. 6 compares the computer time required to complete a specific number 
of games per player by the old and the new algorithm. Note that during the 
initial games the old algorithm takes lesser computer time than the new one. 
During this phase, almost all games produce an outcome and therefore the ad- 
ditional calculation of the probabilities required for the fast algorithm is not 
advantageous. However, as soon as the dynamics gets trapped in metastable 
states (at the onset of the plateau region in the number of linguistic categories 



We use the in-built clock() function of the GNU C library to estimate the value 
of te- 
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Fig. 5. Microscopic observables. (a) Probability of having an outcome. The solid 
lines show the average fraction of games collected in time sliding windows versus 
tjN for the old algorithm. The broken lines show the average values of pout in the 
same time sliding windows versus tjN for the new algorithm. The inset shows the 
(representative) histogram of pout for each possible pair of agents at three different 
points in time for N = 50. (b) The fraction of games, collected in time sliding 
windows, which ended up being played in a mismatch region for the old (solid lines) 
and the new algorithm (broken lines), (c) The fraction of games, collected in time 
sliding windows, which ended up being played in a match region for the old (solid 
lines) and the new algorithm (broken lines), (d) The fraction of games, collected 
in time sliding windows, where a discrimination process occurred for the old (solid 
lines) and the new algorithm (broken lines). All our results are for N = 25, 50 and 
100 and each of them are averaged over 30 samples. 

curve in fig. 2) the calculation of the probabilities turns out to be very advan- 
tageous and the new algorithm has a much higher velocity than the old one. 
Note that for larger population sizes (A^ = 200 and 400) we could manage to 
reach only a lower value oi t/N within a reasonable tc for the old algorithm. 
In the inset of fig. 6 the ratio of the two algorithms' execution time is shown, 
to better appreciate the advantage of the no-rejection one. 



In fig. 7, we present a further analysis of the scaling of tc with the popula- 
tion size for the no-rejection algorithm. In particular, we report two different 
quantities in this figure for = 200, 400 and 800: 

(i) Pout versus the rescaled number of games. Note that one needs to rescale t 
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Fig. 6. Computer time {tc) in seconds required to complete a specific number of 
games per player by the old and the new algorithm for N = 25, 50, 100, 200 and 
400. Solid lines show the computer time required by the old algorithm while broken 
lines indicate the computer time required by the new algorithm. The results are 
obtained on the ISl cluster with the following node specification: (a) Processor - 
INTEL XEON E5405 2.00GHz, (b) Cache - 6144KB, (c) ISA: 64-bit and (d) RAM 
- 8GB. The inset shows the ratio of tc for the new algorithm ([tc]new) to the tc for 
the old algorithm ([tc]oid) ^oi the same population sizes as in the main figure. 

by iV^/^ to collapse the curves. This dependence of the time of the dynamics 
on the population size has also been recovered in several other cases else- 
where [19,24] and usually indicates the time scaling to reach a consensus in 
the population. Clearly, the rate at which the values of Pout drop decreases 
with increasing t. 

(ii) the rescaled tc versus the rescaled number of games (i.e., as m 

case (i)). In order to collapse these curves, especially in the "large" t regime 
(featuring the long-time dynamics) one has to rescale tc by ty/N. In addition 
the entire factor is multiplied by a large constant A (~ 2 x 10^) in order to 
present a better visualization of the plots within the same figure as of Pout- 



The most important observation is that the long-time behavior is exactly 
similar to that of Pout which indicates that the computer time required is 
largely determined by the probability of outcomes. The inset of the same figure 
shows the amount of tc required to complete t = 5 x 10^, 5 x 10^ steps for the 
old and the new algorithm for different population sizes. In all the four cases, 
the curves can be nicely fitted using a function of the form: f{N) = /3y/N. 
This observation once again confirms the dependence: tc oc \/N. An important 
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Fig. 7. The scaling of tc with the population size for the new algorithm, (i) pout 
versus the rescaled number of games, t is rescaled by N^/'^ in order to collapse the 
curves, (ii) Rescaled versus the rescaled number of games (i.e., t/iV3/2 
tc is rescaled by t\/lV and then the entire factor is multiplied by a large constant 
A (~ 2 X 10^) in order to present a better visualization of the plots within the 
same figure as of Pout- ^ = 200,400 and 800. The results are obtained on the ISI 
cluster with the following node specification: (a) Processor - INTEL XEON E5405 
2.00GHz, (b) Cache - 6144KB, (c) ISA: 64-bit and (d) RAM - 8GB. The inset 
shows the amount of tc required to complete t = 5 x 10^, 5 x 10® steps for the old 
(continuous lines) and the new (dashed lines) algorithm for different population 
sizes. All the four curves can be fitted with a function of the form: f{N) = f5^/N . 
For the old algorithm (3 = 8.25 and 104.92 while for the new algorithm (3 = 2.16 
and 34.66 respectively for t = 5 x 10'' and t = 5 x 10*^. 



point to note is that the pre-factor /3 is significantly lower for the new algorithm 
as compared to the old one. The dependence of tc on a/ZV can be attributed to 
the time required by the different processes of the model (e.g., discrimination, 
inventory updates etc.). A detailed analysis of this dependence is out of the 
scope of the current paper and shall be presented elsewhere. 
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6 Conclusion 



In this paper we have introduced a no-rejection algorithm developed to study 
the long time behavior of the Category Game model. The original model [1] 

approaches one of the most important problems in linguistics - the emergence 
of linguistic categories - and was shown to reproduce both qualitatively and 
quantitatively experimental results reported in the WCS [10]. The two main 
innovative aspects of the model, with respect to previously proposed ones, 
are (i) the dynamical emergence of a discretization from a continuous per- 
ceptual space and (ii) the representation of the present category system as a 
long lasting metastablc state rather then an attractor of the dynamics. The 
last property has triggered the need for suitable methods to achieve the long 
time dynamics of the model. In particular, the observed dynamics was such 
that games which bring a modification of the state of the agents resulted pro- 
gressively more rare. The analysis of the behavior of the system for suitably 
large time and population sizes was for that reason practically impossible. The 
no-rejection algorithm presented here has been crucial to access the long time 
dynamical properties of CG, characterized by metastability and aging [19], and 
thus to shed light on so far unexplored part of the model, helping to clarify 
the nature of the consensus states that are established during the CG dynam- 
ics, with important consequences for the understanding of such phenomena as 
language change and language evolution. The no-rejection model we propose, 
despite being ad hoc for the CG, presents principles and methods that can be 
generalized for different agent based models and we beheve that that could 
trigger computationally less expensive methods suitable to investigate social 
phenomena. 



References 

[1] A. Puglisi, A. Baronchelli, and V. Loreto. Cultural route to the emergence of 
linguistic categories. PNAS, 105(23) :7936, 2008. 

[2] J.R. Taylor and J.R. Taylor. Linguistic categorization. Oxford University Press 
New York, 2003. 

[3] T.W. Deacon. The symbolic species: The co-evolution of language and the brain. 
Norton & Company, New York, USA, 1998. 

[4] H. Gardner. The Mind's New Science: A History of the Cognitive Revolution. 
Basic Books, New York, 1985. 

[5] George Lakoff. Women, fire, and dangerous things: What categories reveal about 
the mind. University of Chicago Press, Chicago, 1987. 



15 



[6] R.W. Casson. Color shift: evolution of English color terms from brightness 
to hue. In Hardin C. and MafR L., editors, Color categories in thought and 
language, pages 224-239. Cambridge University Press, Cambridge, 1997. 

[7] B. Berlin and P. Kay. Basic Color Terms. Berkeley: University of California 
Press, 1969. 

[8] P. Kay and T. Regier. Resolving the question of color naming universals. PNAS, 
100(15):9085-9089, 2003. 

[9] G.L. Murphy. The big book of concepts. Bradford Book, 2004. 

[10] A. Baronchelli, T. Gong, A. Puglisi, and V. Lorcto. Modelling the emergence 
of universality in color naming patterns. PNAS, 107:2403-2407, 2010. 

[11] T. Belpaeme and J. Bleys. Explaining universal color categories through a 
constrained acquisition process. Adap. Behv., 13(4):293-310, 2005. 

[12] J. Bleys, M. Loetzsch, M. Spranger, and L. Steels. The grounded colour naming 
game. In Proceedings Spoken Dialogue and Human-Robot Interaction workshop 
at the RoMan 2009 conference, 2009. 

[13] M. Dowman. Explaining Color Term Typology With an Evolutionary Model. 
Cog. Sci., 31(1):99-132, 2007. 

[14] K.A. Jameson and N.L. Komarova. Evolutionary models of color categorization. 

I. Population categorization systems based on normal and dichromat observers. 
Journal of the Optical Society of America A, 26(6):1414-1423, 2009. 

[15] K.A. Jameson and N.L. Komarova. Evolutionary models of color categorization. 

II. Realistic observer models and population heterogeneity. Journal of the 
Optical Society of America A, 26(6):1424-1436, 2009. 

[16] N.L. Komarova and K.A. Jameson. Population heterogeneity and color stimulus 
heterogeneity in agent-based color categorization. Journal of Theoretical 
Biology, 253(4) :680-700, 2008. 

[17] N.L. Komarova, K.A. Jameson, and L. Narens. Evolutionary models of color 
categorization based on discrimination. J. Math. Psych., 51(6):359-382, 2007. 

[18] L. Steels and T. Belpaeme. Coordinating perceptually grounded categories 
through language: A case study for colour. Behav. Brain Sci., 28(04): 469-489, 
2005. 

[19] A. Mukherjee, F. Tria, A. Baronchelli, A. Puglisi, and V. Loreto. Aging in 
language dynamics, preprint, 2010. 

[20] M. Kolesik, M. A. Novotny, and P. A. Rikvold. Extreme long-time dynamic 
monte carlo simulations for metastable decay in the d=3 ising ferromagnet. 
Int. Jour, of Mod. Phys. C, 14(1):121-131, 2003. 

[21] M. A. Novotny. Monte Carlo algorithms with absorbing Markov chains: Fast 
local algorithms for slow dynamics. Phys. Rev. Lett., 74(1), 1995. 



16 



[22] K. Binder. Applications of monte carlo methods to statistical physics. Rep. 
Prog. Phys., 60, 1997. 

[23] Ludwig Wittgenstein. Philosophical Investigations. (Translated by Anscombe, 
G.E.M.). Basil Blackwell, Oxford, UK, 1953. 

[24] A. Baronchelli, M. Felici, E. Caglioti, V. Loreto, and L. Steels. Sharp transition 
towards shared vocabularies in multi-agent systems. Journal of Statistical 
Mechanics, P06014, 2006. 



17 



